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Summary. The purpose of this note is to prove a lower bound for the estimation of 
the memory parameter of a stationary long memory process. The memory parameter 
is defined here as the index of regular variation of the spectral density at 0. The 
rates of convergence obtained in the literature assume second order regular variation 
of the spectral density at zero. In this note, we do not make this assumption, and 
show that the rates of convergence in this case can be extremely slow. We prove 
that the log-periodogram regression (GPH) estimator achieves the optimal rate of 
convergence for Gaussian long memory processes 



1 Introduction 

Let be a weakly stationary process with autocovariance function 7. Its spectral 

density /, when it exists, is an even nonnegative measurable function such that 



Long memory of the weakly stationary process {Xt} means at least that the auto- 
covariance function is not absolutely summable. This definition is too weak to be 
useful. It can be strengthen in several ways. We will assume here that the spectral 
density is regularly varying at zero with index —a G (—1, 1), i.e. it can be expressed 
for a; > as 



where the function L is slowly varying at zero, which means that for all t > 0, 




X "i(a;) , 



lim^ 

x^o L{x) 



= 1 . 



Then the autocovariance function is regularly varying at infinity with index a — 1 
and non absolutely summable for a > 0. The main statistical problem for long 
memory processes is the estimation of the memory parameter a. This problem 
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has been exhaustively studied for the most familiar long memory models: the frac- 
tional Gaussian noise and the ARFIMA(p, d, q) process. The most popular estima- 
tors are the GPH estimator and t he GSE estimator, first introduced respectively by 
iGeweke and Porter- HudakI (|1983D and Kucnsch ( 198"^}^ Rigorous theoretical results 
for these estimators were obtained by Robinson ( 1995a 'b'), under an assumption of 
second order regular variation at 0, which roughly means that there exists C, p > 
such that 

f{x)^Cx-'^{l + 0{x'')} . 

Under this assumption, iGiraitis et al.l (|l997f ) proved that the optimal rate of con- 
vergence of an estimator based on a sample of size n is of order u'^p/C^p+'^'i , 

The methodology to prove these results is inspired from similar results in tail 
index estimation. If _F is a probability distribution function on (— cx3,oo) which is 
second order regularly varying at infinity, i.e. such that 

F{x) ^Cx-^ll + Oix-"")} 

as a; — > oo, then iHall and Welihl { 19841 ) proved that the best attainable rate of 
convergence of an estimator of the tail index a based on n i.i.d. observa t ions d rawn 
from the distribution F is of order n''P^'^'^P+^\ In this context. iDreesI l|l998l ) first 
considered the case where the survival function F is regularly varying at infinity, 
but not necessarily second order regularly varying. He introduced very general classes 
of slowly varying functions for which optimal rates of convergence of estimators of 
the tail index can be computed. The main finding was that the rate of convergence 
can be extremely slow in such a case. 

In the literature on estimating the memory parameter, the possibility that the 
spectral density is not second order regularly varying has not yet been considered. 
Since this has severe consequences on the estimations procedures, it seems that this 
problem should be investigated. In this note, we parallel the methodology developped 
bv iDree j (|l998l ) to deal with such regularly varying functions. Not surprisingly, we 
find the same result, which show that the absence of second order regular variation 
of the spectral density has the same drastic consequences. 

The rest of the paper is organised as follows. In Section (2] we define the classes 
of slowly varying functions that will be considered and prove a lower bound for the 
rate of convergence of the memory parameter. This rate is proved to be optimal 
in Section [S] An illustration of the practical difficulty to choose the bandwidth 
parameter is given in Section |4] Technical lemmas are deferred to Section (5] 



2 Lower bound 

In order to derive precise rates of convergence, it is n ecessary to re strict attention 
to the class of slowly varying functions referred to bv lDreej lll998l) as norma lised. 



This class is also referred to as the Zygmund class. Cf. ( Bingham et al.l . ll989l . Sec- 
tion 1.5.3) 

Definition 1. Let rj* he a non decreasing function on [0,7r], regularly varying at 
zero with index p > and such that Mmx^o rj* (x) — 0. Let SV{ri*) be the class of 
even measurable functions L defined on [— vr, tt] which can be for expressed x > as 
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L{x) = L{n) exp <; - / ^ ds 



s 



for some measurable function rj such that \rj\ < rj* . 

This representation implies that L has locally bounded variations and 77(3) — 
sL'{s)/L{s). Usual slowly varying functions, such as power of logarithms, iterated 
logarithms are included in this class, and it easy to find the corresponding rj function. 
Examples are given below. We can now state our main result. 

Theorem 1. Let rj* be a non decreasing function on [0,7r], regularly varying at 
with index p> and such that lima;_,o 77*(a;) = 0. Let tn be a sequence satisfying 

lim = 1. (1) 

n — >-oo 

Then, if p > 0, 



and if p = 



liminfinf sup sup Ea,L[??*(tn) \an — a\] > Q , (2) 

ri^oo dn LeSV(j7*) qS(-1,1) 



liminfinf sup sup Ea.L [??* (tn) ""^jon — aj] > 1 , (3) 

n^oo a„ LgSV(i7*) ae(-l,l) 



where Pa,L denotes the distribution of any second order stationary process with spec- 
tral density x~"L{x) and the infimum infa„ is taken on all estimators of a based 
on n observations of the process. 

Example 1. Define 7?*(s) = Cs^ for some /3 > and C > 0. Then any function 
L e SViv") sa1. i sfies L (x) = L{0) + 0{x>^), and we recover the case considered 
bv lGiraitis et al.1 (|l997l ). The lower bound for the rate of convergence is n'^^^'^''"'"^'. 

Example 2. For p > 0, define ri*{s) = p/log(l/s), then 

"i/i! ri*(s) 

exp / — ds ^ = exp {plog log(l/2;)} = log''(l/2;) 



A suitable sequence t„ must satisfy p^/log^(f„) « nt„. One can for instance 
choose t„ = log^(n)/(np^), which yields rj*{t„) — p/log(n){l + o(l)}. Note that 
ri{s) — p/log{s) belongs to 51^(77*), and the corresponding slowly varying function 
is log~''(l/a;). Hence, the rate of convergence is not affected by the fact that the 
slowly varying function vanishes or is infinite at 0. 

Example 3. The function L[x) — logTog(l/a;) is in the class SV{r]'') with rj*{x) — 
{log(l/x) log log(l/x)}^^. In that case, the optimal rate of convergence is log(n) log log 
Even though the slowly varying function affecting the spectral density at zero di- 
verges very weakly, the rate of convergence of any estimator of the memory param- 
eter is dramatically slow. 
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Proof of Theorem [T] Let £ > 0, tn he a sequence that satisfies the assumption of 
Theorem [TJ and define a„ = 77*(tt„) and 

ifO<s<^i„, 
a„ if £t„ < s < TT, 



Ln{x) = tt"" exp 



rj„{s) ds 



Since rj* is assumed non decreasing, it is clear that L„ £ SV{Tj*). Define now /„ (x) — 
x~"" Ln{x) and — {f^)~^- /„ can be written as 

r (tt„/a;)"" if < X < lt„, 
\ 1 if £t„ < a; < TT. 

Straighforward computations yield 

r 

{/n (2:) - fn{^)f dx = 8ft„a^(f + 0{al)) = 8£n-'{l + o(l)). (4) 



The last equality holds by definition of the sequence t„. Let P,7 and P„ denote the 
distribution of a n-sample of a stationary Gaussian processes with spectral densities 
et respectively, E~ and the expectation with respect to these probabilities, 
— — the likelihood ratio and A„ — > rj for some real r G (0, 1). Then, for any 

d¥~ dP„ — ' \ ' / 1 J 

estimator &„, based on the observation {Xi, . . . ,X„), 

supEc«,L[ld„ - a]] > i (E^ [|q„ - a„|] + E;;[|d„ + a„|]) 



1a„ On + On + — — 1a„ On — ftn 



> -E„ [{|a„ + a„| + r|a„ - a„|}lA„] > Ta„P„ {A^). 

Denote e = log(l/r) and = log ( dP+/ dP~). Then P^(^„) = 1 - P;:(/ln < 
— e). Applying and ()Giraitis et al. . [l997l . Lemma 2), we obtain that there exist 
constants Ci and C2 such that 

E^ [yl„] < Ci^ , E- - m^)"] < Ca^ . 

This yields, for any > and small enough £, 

Pn (^n) > 1 - e"^E[yl^] > 1 - C/e"^ > 1 - ?? . 

Thus, for any rj,T £ (0, 1), and sufficiently small £, we have 



liminf inf inf lS,a lIti* (tn) |a„ — all 

n->oo LgSV(?7*) a6{-l,l) ' u ^ ' i 



> r(l — ?7) lim 



rr{£t„ 
V* {in 



r(l -77)^" 



This proves ^ and Q. 
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3 Upper bound 

In the case ri*{x) = Cx'' with p > 0, iGiraitis et al.l (|l997l ) have shown that the 
lower bound ^ is attainable. The extension of their result to the case where 77* is 
regularly varying with index p > (for example to functions of the type a;'' log(a::)) 
is straightforward. We will restrict our study to the case p — 0, and will show that 
the lower bound is asymptotically sharp, i.e. there exist estimators that are rate 
optimal up to the exact constant. 

Define the discrete Fourier transform and the periodogram ordinates of a process 
X based on a sample Xi, . . . , X„, evaluated at the Fourier frequencies Xj = 2jn/n, 
j = 1, . . . ,n, respectively by 

n 

dx,j = {2TTn)~^^^'^Xte~'*'^^ , and Ix,j ^ \dx.jf ■ 
t=i 

The frequency domain estimates of the memory parameter a are based on the fol- 
lowing heuristic approximation: the renormalised periodogram ordinate Ix,j / f{xj), 
^ ^ j ^ n/2 axe approximately i.i.d. standard exponential random variables. Al- 
though this is not true, the methods and conclusion drawn from these heuris- 
tics can be rigourously justified. In particular, the Geweke and Porter-Hudak 
(G PH) and Gaussian semiparametr ic est imator have be en respectively proposed 
by iGeweke and Porter-HudakI (Il983l ) and iKuenschI (|l987l ) , and a theory for them 



was obtained bv lRobinsonI ([l995b||aD in the case where the spectral density is second 



order regularly varying at 0. 

The GPH estimator is based on an ordinary least square regression of \og{Ix,k) 
on log(fc) for fc = 1, . . . , m, where m is a bandwith parameter: 

m 

(A(m), (7) = arg min {log{Ix,k) - C + a log(fc)}^ . 

fc = l 

The GPH estimator has an explicit expression as a weighted sum of log-periodogram 
ordinates: 

m 

&{m) = -Sm^ t^m.fc log(J_y,fc), 

with ly^^k = log(fc) - m'^ T,"Li ^°s{j) and = Yl^^^i ^li,k = + 

Theorem 2. Letrj* be anon decreasing slowly varying function such that linix^o V* {^) 
0. Let Ea,L denote the expectation with respect to the distribution of a Gaussian pro- 
cess with spectral density x~"L{x). Let tn be a sequence that satisfies {!]) and let m 
be a nan decreasing sequence of integers such that 

lim m^^^ri* {t„) = 00 and lim ^ J' = 1 . (5) 

n— >oo n^aDr]*[m/n) 

Assume also that the sequence m can be chosen m such a way that 



log(m) , s ^ri*(s)ds 

hm — = . (6) 

n-»oo mri*(m/n) 



Then, for any 5 £ (0, 1), 



limsup sup sup rf {t„) Ea,L[(Q(m) — a) ] < 1. (7) 

n^oo |a|<i5 L6SV(?7*) 
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Remark 1. Since i]* is slowly varying, it is always possible to choose the sequence m 
in such a way that ([5]) holds. Condition ((6)1 ensures that the bias of the estimator is 
of the right order. It is very easily checked and holds for all the examples of usual 
slowly varying function rj*, but we have not been able to prove that it always holds. 

Since the quadratic risk is greater than the risk, we obtain the following 
corollary. 

Corollary 1. Let S £ (0, 1) and r;* be a non decreasing slowly varying function 
such that lim^^^o '7*(a;) = and such that it is possible to choose a sequence m that 
satisfies Then, for in as in {!]), 

liminfinf sup sup Ea, lIv* i'tn)^^\&n ^ ct\] — 1 . (8) 

Remark 2. This corollary means that the GPH estimator achieves the optimal rate 
of convergence, up to the exact constant over the class SV{ri*) when ry* is slowly 
varying. This implies in particular that, contrarily to the second order regularly 
varying case, there is no loss of efficiency of the GPH estimator with respect to the 
GSE. This happens because in the slowly varying case, the bias term dominates 
the stochastic term if the bandwidth parameter m satisfies ((S)). This result is not 
completely devoid of practical importance, since when the rate of convergence of an 
estimator is logarithmic in the number of observations, constants do matter. 

Example 4 (Example\^continued). If L(a;) — log''(l/x)L(a::), where L £ SV{Cx'^) for 
some p>, P > and C > 0, then J^^i i^m,fc log(L(a;fe)) ~ pmlog'^ (xm)- Choosing 
m = log'^"'"''(n) yields (O, © and log{n){a{m) — a) converges in probability to p. 

Proof of Theorem\M Define £k = log{x'^Ik/L{xk)}. The deviation of the GPH 
estimator can be split into a stochastic term and a bias term: 

m m- 

a(m) - a = -s~^ ^ T^m.,k£k - ^ i^ni.k log{L{xk))- (9) 

k = l k = l 

Applying Lemma [2j we obtain the following bound: 

^[{Y^'^m,klog{£k)} ] <C((5,77*)m. (10) 

k = l 

The bias term is dealt with by applying Lemma [T]wich yields 

m 

I ^J^m,felog(L(3;fc))| < mr?*(2;™^){l + o(l)}, (11) 
uniformly with respect to \ri\ < rf . Choosing m as in (O yields ((TJ. □ 



4 Bandwidth selection 

In any semiparametric procedure, the main issue is the bandwidth selection, here the 
number m of Fourier frequencies used in the regression. Many methods for choosing 
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m have been suggested, all assuming some kind of second order regular variation 
of the spectral density at 0. In Figures [!]■ |3] below, the difficulty of choosing m 
is illustrated, at least visually. In each case, the values of the GPH estimator are 
plotted against the bandwidth m, for values of m between 10 and 500 and sample 
size 1000. 

In Figure [1] the data is a simulated Gaussian ARFIMA(0,d,0). The spectral 
density / of an ARFIMA(0,d,0) process is defined by /(x) = a^\l - e'"=|~^'*/(27r), 
where is the innovation variance. Thus it is second order regularly varying at 
zero and satisfies f{x) = x~"{C + 0{x^)) with a — 2d. The optimal choice of the 
bandwidth is of order n"^^^ and the semiparametric optimal rate of convergence is 
n^^^. Of course, it is a regular parametric model, so a -y/n consistent estimator is 
possible if the model is known, but this is not the present framework. The data in 
Figure [2] comes from an ARFIMA(0,d,0) observed in additive Gaussian white noise 
with variance r^. The spectral density of the observation is then 



It is thus second order regularly varying at and the optimal rate of convergence 
is 71^''/'''''+^', with optimal bandwidth choice of order . In Figures H and [2] 

the outer lines are the 95% confidence interval based on the central limit theorem 
for the GPH estimator of d = a/2. See iRobinsoiil (|l995b). 

A visual inspection of Figure [T] leaves little doubt that the true value of d is close 
to .4. In Figure [2j it is harder to see that the correct range for the bandwidth is 
somewhere betwen 50 and 100. As it appears here, the estimator is always negatively 
biased for large m, and this may lead to underestimating the value of d. Methods 
to correct this bias (when the c orrect mod el is known) have b een proposed and 
investigated bv lHurvich and Ravi (,2003. ) and lHurvich et all (|2005l '). but again this is 
not the framework considered here. 

Finally, in Figure |3] the GPH estimator is computed for a Gaussian process with 
autocovariance function ■y{k) = l/(fc+l) and spectral density log |1 — e'^p. The true 
value of a is zero, but the spectral density is infinite at zero and slowly varying. The 
plot d(m) is completely misleading. This picture is similar to what is called the Hill 
"horror plot" in tail index estimation. The confidence bounds are not drawn here 
because there are meaningless. See Example 3] 

There has recently been a very important literature on methods to improve the 
rate of convergence and/or the bias of estimators of the long memory parameter, 
always under the assumption of second order regular variation. If this assumption 
fails, all these methods will be incorrect. It is not clear if it is possible to find 
a realistic method to choose the bandwidth m that would still be valid without 
second order regular variation. It might be of interest to investigate a test of second 
order regular variation of the spectral density. 



^2 ^2 2 ( J2 

^ I, ill— 2d . ^ II ix\—2d I 1 I ^ n ill 



\x\2d 



} 



5 Technical results 



Lemma 1. Let rf he a non decreasing slowly varying function on (0, tt] such that 
lims->o (s) = 0. Let -q be a measurable function on (0, vr] such that \r]\ < rj* , and 
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Fig. 2. GPH estimator for ARFIMA(0,.4,0)+ noise 



define h{x) = ^ ds and h*{x) — ^ j^"^ ds. Then, for any non decreasing 

sequence m < n, 

m 

'^Vm{k)h{xk) < mr;*(3;„) + 0(log^(m)r;*(x™,) + log(m)/i*(a;m)), (12) 
ft=i 

uniformly with respect to \rj\ < rj* . 

Proof. Since rj is slowly varying, the function h is also slowly varying and satisfies 
lima:^o v{^)/f^{^) = 0- Then, 
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Fig. 3. GPH "horror" plot 



Z]'*(^'=) = ^/ Ks)ds+^Y.I {Kxu) - h{s)} ds 



2-K 



Thus, for < rj* and rj* increasing, 
J^K^k) - ^ / h{s)ds 



<- *( ^ n r .1 



t 



2tv 

= I ^log(A:) - mlog(m) + i 

lfe=i 

= 0(ry*(a;m)log(m)) . 
By definition, it holds that xh'{x) — ri{x). Integration by parts yield 



n_ r 

Jo 



h{s) ds = mh{xm) / 77(3) ds 



Thus, 



1 

Similarly, we have: 



'^h{xk) = mh{xm) ~ ^ I ri{s)ds + 0(log{m)ri*(xm)) 
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y2 h{xk) log{x) ^ mh{xm) iog{x,n) - — / {77(3) log(s) + /!,(s)} da; 
fc=i '^'^ Jo 

+ 0(log(m){log(s™)77*(2;™) + ft* (a;™)}) . (15) 

By definition of Vm{k), we have: 

^ m 

i^,„(fc) = log(fc) V log(j) = log{xk) - log(a;„) + I + O1 

m ^ — ' 

j=i 

Hence, applying (|13p . (|14p and (|15|) . we obtain 

m 

y~^t^m(fc)fe(xfc) 



log(m) 



^log(a;ft)/i(a;fc) - {log(j:m) - 1 + 0(log(m)/m)} h{xi,) 

k=l fc=l 

n 

+ 0(log(m){77*(Km) log(a;„) + h*{xm)}) 



^(s){log(a;m) - Iog(s)}ds 



Finally, since I??! < r/* and 77* is non decreasing, we obtain 
v{s){^og{xm) - log(s)} 



n 
2^ 



ds = mrfixm)- 



This yields (HU. 



Lemma 2. Letr/* be a non decreasing slowly varying function such that limx—,o f]* {x) 
0. Let X be a Gaussian process with spectral density f(x) — x~"L{x), where 
a G [—5,5] CLnd L G SV{ri*). Let 7 = 0, 577216... denote Euler's constant. Then, for 
all n and all k,j such that < Xk, Xj < it/2, 



E[log='(f,)] - y < C(<5,77*)log(l + fc)fe-\ 



!E[log(£:fe)]-7l + 

lE[log(£:0 -7)(log(£,) -7)]| < C{S,ri')log\j)k-\ 
Proof of Lemma [2] 

It is we ll know n (see for instance iHurvich et all l|l998h . lMoulines and Souliej (|l999l ). 
ISouliej l|200j )') that the bounds of Lemma [2] are co nsequences of the covariance 
inequality for functions of Gaussian vectors of (jArcon cs. 1994, Lemma 1) and of the 
following bounds. For all n and all k,j such that < |a:fc| < < 7r/2, 



\cov{dx,k,dxj)\ + \cov{dx,k,dx,j) ~ fixk)Sk,j\ 

<C{5,r^'Wfixk)f{x,) log{j)k-' 



Best attainable rates of convergence 



11 



Such bounds have been obained when the spectral density is second order regularly 
varying. We prove these bounds under our assumptions that do not imply second 
order regular varition. Denote Dn{x) = {2Tm)~^^^ X]t=i e~'*^. Then 

f{x)D„{xk - x)D„{xj + x)dx. 

Recall that by definition of the class SV{rj*), there exists a function rj such that 
I*?! ^ V* L{x) — L{tv) exp{— s~^ri{s)ds}. Since only ratio L{x)/L{tv) are 
involved in the bounds, without loss of generality, we can assume that L{n) — 1. We 
first prove that for all k such that Xk < 7i"/2, 

' ( 4r^ - A \Dn{xk - x)\^ dx <C(<5,77*)log(fc)fc-^ (16) 

Since L G SV{i]*), the functions x'^L{x) and x'^L^^{x) are bounded for any e > 
and 

sup x\L{x)+L-''{x))<C{v'',S) , (17) 

fix)dx < ci-n^s) . (18) 

Since rj* is increasing, for all < 2: < y < tt/2, it holds that 

\f{x)-fiy)\ = \x-"Lix)~y-"Liy)\ 
rv 

< I |« — 77(s)|s^"^ L{s)ds 

J X 

< /"(l + r?*(7r))s-"-'L(s)ds . 

Since a G [—1 + (5, 1 — 6], x~^~'^L{x) is decreasing. Hence 

\f{x)~f{y)\<C{r,\d)x-'f{x){y~x). (19) 
Define Fn{x) = \Dn{x)\'^ (the Fejer kernel). We have 

/TT 
F„{x)dx^l, (20) 

D„{y + x)D„{z -x)dx= {2imy^''^Dr,{y + z) . (21) 

From now on, C will denote a generic constant which depends only on ?7*, S and 
numerical constants, and whose value may change upon each appearance. Applying 
ifTf)) . (fT8)) and ((20|, we obtain 

f \rHxk)f{x) - l\Fn{x^Xk)dx < Cn-'-{f-\xk) + l) < Ck'^ . 

J IT /2<\x\<TT 

The integral over [— 7r/2,7r/2] is split into integrals over [— 7r/2, —Xk/2\ U \2xk,'n /2\, 
\-Xk/2, Xk/2\ and [xk/2, 2xk\. If a; G [-7r/2, -Xk/2\ U [2x^,71/2], then F„(x - Xk) < 
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Cn~^x~^. Hence, applying Karamata's Theorem (cf. (|Bingham et al. I, Il989l . Theo- 
rem 1.5.8)), we obtain: 



/ — a;fc/2 r — Xf^/^ poo 

+ / F„{x) dx < Cn-^ / x-^ dx < CrT^xl^ < Ck'^ 

-7r/2 Jxi,/2 



+ / f{x)Fr^{x)dx 
n/2 J-Tv/2 

f-Tr/2 

< Cn' / L{x)dx < Cn~ x^°'~ L{xk) < Ck~ f{xk) 

-Xk/2 r-Xk/2 

+ / ^ 

t/2 J-tv/2 Jxk/2 



For X G [~Xk/2, Xk/2], Fn{xk — x) < n ^x^ '^. Thus, applying again Karamata's 
Theorem, we obtain; 

f{x)Fn{x — Xk)dx<Cn Xf, I x °'L{x)dx 

-Xk/2 J-Xk/2 

< Cn'~^xl'^xl°'^^L{xu) < Ck'^ f{xk) , 
Fn{x — Xk)dx < Cn~ x^^ < Ck~ . 

-^k/2 

Applying (|19p and the bound /^^^/j '^x < Cn~^ log(fc), we obtain: 

2xf. 

\f{x) - f{Xk)\Fn{x - Xk) dx 

r^^k 

< Ca;-"-^L(xfc/2) / \x - Xk\F„{x - Xk)\dx < Cf{xk)k-^ log(fc) . 

Jxk/2 

This proves (|16p . We now prove that all k,j such that Q < Xk ^ \xj \ < n /2, 
( fl^) ^ ^"'•^'^ ^ x)D„(xj — x) dx 
\f{xk) 



+ I (4T^-^]Dn{xk-x)D^{xj~x)dx 



<C(5,77*)log(fcV|j|)(fcAljl)-\ (22) 



Define E„^k,j{x) := D„{xk — x)D„{xj —x). Since < Xk,xj < n/2, for 7r/2 < |x| < 
TT, we have \En,k,j{x)\ < Cn^^ . Hence, as above, 

\f'^{xk)fix) - 1| dx < Cn~'^{xkL-^{xk) + 1) < Cfc"\ 

/2<]x\<TT 

We first consider the case k < j and we split the integral over [— 7r/2,7r/2] into 
integrals over [-■K/2,-Xk/2\ U [2a;,,7r/2], [-2:^/2, 2:^/2], [xk/2, {xk + Xj)/2], [{xk + 
Xj)/2, 2xj], denoted respectively Ii, I2, Is and I4. 

• The bound for the integral over [—7r/2, —2:^/2] U [2xj,n/2] is obtained as above 
(in the case k = j) since \E„^k,i\ < Cn^^x^^ . Hence < Ck^^ . 

• For x £ [—Xk/2, Xk/2], \En^k,j{x)\ < Cn~^x~^^, hence we get the same bound: 
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• To bound I3, we note that on the interval [xk/2, {xk + Xj)/2], 

\E„,k,j{x)\ < Cn^/\j - k)\Dn{x ^ Xk)\, 
and n^/^jx — xt\\D„{x — Xk\ is uniformly bounded. Hence, applying (I19p . we obtain 

\h\ < c{j - ky'x^'x, < ck-\ 

• The bound for I4 is obtained similarly: IJ4I < Ck~^ log(j). 

• To obtain the bound in the case Xj < Xk, the interval [—tt, vr] is split into 
h7r,-7r/2]u[7r/27r] [--k /2,-Xk/2]yj[2xk,-K /2], [-Xk/2,Xj /2], [x, /2,{xk+Xj) /2\ and 
[(xfc -\- Xj) /2,2xk\. The arguments are the same except on the interval [— Xfc/2, Xj/2] 
where a slight modification of the argument is necessary. On this interval, it still 
holds that |-E„,fc,j(j:)| < n~^a;^'^. Moreover, x^ L{x) can be assumed increasing on 
[0,a::fc/2], and we obtain: 

/xj/2 r^jf^ 
x~°'L{x)dx < XkL{xk) I x~"~ dx < Cx^"'^ L{xk) ■ 

The rest of the argument remains unchanged. 

• To obtain the bound in the case Xj < < Xk, the interval [— tTjTt] is split 
into [-TT, -7r/2] U [7r/2, tt] h7r/2, 2xj] U [2a;fe, 7r/2], [2x.i , -Xk/2], [-Xk/2,Xk/2] and 
[xk/2, 2xk] and the same arguments are applied. □ 
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